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' We develop state-independent importance sampling based efficient simulation techniques 

for two commonly encountered rare event probabilities associated with random walk (S n : 

n > 0) having i.i.d. regularly varying heavy-tailed increments; namely, the level crossing 

probabilities when the increments of S n have a negative mean, and the the large deviation 

probabilities ¥{S n > 6}, as both n and b increase to infinity for the zero mean random walk. 

Exponential twisting based state-independent methods, which are effective in efficiently 

■ estimating these probabilities for light-tailed increments are not applicable when these are 

! heavy-tailed. To address the latter case, more complex and elegant state-dependent efficient 

simulation algorithms have been developed in the literature over the last few years. We 

propose that by suitably decomposing these rare event probabilities into a dominant and 

further residual components, simpler state- independent importance sampling algorithms can 

be devised for each component resulting in composite unbiased estimators with a desirable 

vanishing relative error property. When the increments have infinite variance, there is an 

£N) | added complexity in estimating the level crossing probabilities as even the well known zero 

variance measures have an infinite expected termination time. We adapt our algorithms so 

that this expectation is finite while the estimators remain strongly efficient. Numerically, 

the proposed estimators perform at least as well, and sometimes substantially better than 

| the existing state-dependent estimators in the literature. 
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O ■ 1. Introduction 



In this paper, we develop importance sampling algorithms involving simple, state-independent 
changes of measure for the efficient estimation of large deviation probabilities, and level crossing 
^ \ probabilities of random walks with regularly varying heavy-tailed increments. Specifically, let 

(X n : n > 1) denote a sequence of zero mean independent and identically distributed (i.i.d.) 
random variables such that F(X n > x) = L(x)x~ a , for some a > I and a slowly varying 
function 1 L(-). Note that a > 2 ensures finite variance for X n whereas a < 2 implies that it 
has infinite variance. Set Sq = and S n = X\ + . . . + X n , for n > 1. Given /i > 0, define 
M := sup n (S' n — n/i), and r& := inf{n > : S n — nfi > b}. We are interested in the importance 
sampling based efficient estimation of: 

1. Large deviation probabilities P{5 n > 6} for b > <nP +t with ft := (a A 2)" 1 as n oo, and 

2. Level crossing probabilities P{rfe < oo}, or equivalently, the tail probabilities F{M > b} 
as b /* oo. 



* Corresponding author. E-mail address: kanrnrthy@tif r . res . in. 

x That is, lirur^oo L(tx)/L(x) = 1 for any t > 0; prominent examples for slowly varying functions include 
(logs)' 3 for any /3 € E. 
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For brevity, we refer to former as large deviations probabilities and the latter as level crossing 
probabilities. Our methodology for estimating the large deviations probabilities easily extends to 
the efficient estimation of P{<Sjv > u} for a random N, when N is light-tailed 2 and independent 
of increments {^n} (popular in literature are N fixed or geometrically distributed) as u / oo . 
However, in the interest of space, we do not explicitly consider the 'random sum tail probabilities' 
estimation problem in this paper. 

Importance sampling via appropriate change of measure has been extremely successful in 
efficiently simulating rare events, and has been studied extensively in both the light and heavy 
tailed settings (see, e.g., Asmussen and Glynn (2007) for an introduction to rare event simulation 
and applications). In importance sampling for random walks, state-dependence essentially 
means that the sampling distribution for generating the increment X^ depends on the realized 
values of X\, . . . , Xk_i (typically, through S^-i); state-independence on the other hand implies 
that samples of X±, . . . , X n can be drawn independently. State-independent methods often enjoy 
advantages over state-dependent ones in terms of complexity of generating samples and ease 
of implementation. The zero- variance changes of measure for estimating the large deviations 
and the level crossing probabilities are well known and are state-dependent (see, e.g., Juneja 
and Shahabuddin 2006). While typically unimplementable, they provide guidance in search for 
implementable approximately zero variance importance sampling techniques. 

In the light-tailed settings, large deviations analysis can be used to show that exponential 
twisting based state-independent importance sampling well approximates the zero variance mea- 
sure (see, e.g., Asmussen and Glynn (2007)) and also efficiently estimates the large deviations 
as well as level crossing probabilities (see, e.g., Sadowsky and Bucklew (1990) and Siegmund 
(1976)). However, development of state independent techniques for these probabilities is harder 
in the heavy-tailed settings. Asmussen et al. (2000) provide an account of failure of simple large 
deviations based simulation methods that approximate zero-variance measure in heavy-tailed 
systems. Bassamboo et al. (2007) prove that any state-independent importance sampling change 
of measure cannot efficiently simulate level crossing probability in a busy cycle of a heavy tailed 
random walk. The fact that the zero-variance measures for estimating both the large deviations 
and the level crossing probabilities are state-dependent, and the above mentioned negative re- 
sults, have motivated research over the last few years in development of complex and elegant 
state-dependent algorithms to efficiently estimate these probabilities (see, e.g., Dupuis et al. 
(2007), Blanchet and Glynn (2008), Blanchet and Liu (2008, 2012), and Chan et al. (2012)). 

In this paper we introduce simple state-independent change of measures to estimate the large 
deviations and the level crossing probabilities with regularly varying increments. We show that 
the proposed methods are provably efficient 3 and perform at least as well as the existing state- 
dependent algorithms. Thus our key contribution is to question the prevailing view that one 
needs to resort to state-dependent methods for efficient computation of rare event probabilities 
involving 'large number' of heavy-tailed random variables. A key idea to be exploited in the 
estimation of probabilities considered is the fact that the corresponding rare event occurrence 
is governed by the "single big jump" principle, that is, the most likely paths leading to the 
occurrence of the rare event have one of the increments taking large value (see, for e.g., Foss 
et al. (2011) and the references therein). Our approach for estimating the large deviations 

2 As is well-known, X is light-tailed if the moment generating function E [exp(#X )] is finite for some 8 > 0, 
and is heavy-tailed otherwise. 

' ! We show that the estimators have asymptotically vanishing relative error; this corresponds to their coefficient 
of variation converging to zero as the event becomes rarer. We also have a related weaker notion of strong efficiency 
where the coefficient of variation of the estimators, and subsequently the number of i.i.d. replications required, 
remains bounded as the event becomes rarer. Weak efficiency is another standard notion of performance in rare 
event simulation corresponding to a slow increase in the number of replications required as the event becomes 
rarer. These are briefly reviewed in Section 2.2. 
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probability ¥{S n > 6} relies on decomposing it into a dominant and a residual component, and 
developing efficient estimation techniques for both. For estimating the level crossing probability 
P{t^ < 00} , in addition to such a decomposition, we partition the event of interest into several 
blocks that are sampled using appropriate randomization. When the increments have infinite 
variance, there is an added complexity in estimating the level crossing probabilities as even 
the well known zero variance measure is known to have an infinite expected termination time. 
We modify our algorithms so that this expectation remains finite while the estimator remains 
strongly efficient although it may no longer have asymptotically vanishing relative error. 
Our specific contributions are as follows: 

1. We provide importance sampling estimators that achieve asymptotically vanishing rela- 
tive error for the estimation of¥{S n > b}, as n /oo. Given n and e > 0, our simulation 
methodology is uniformly efficient for values of b larger than n2 +e when the increments X n 
have finite variance, and for b > n in the case of increments having infinite variance 
- thus operating throughout the large deviations regime where the well-known asymp- 
totics P{S n > b} ~ nF(b) hold. Further, this is the first instance that we are aware of 
where efficient simulation techniques for the large deviations probability include the case 
of increments having infinite variance, which is not uncommon in practical applications 
involving heavy-tailed random variables. 

2. For a > 1, we develop unbiased estimators for level crossing probabilities P{t;, < 00} that 
achieve vanishing relative error as b / 00. These estimators require an overall compu- 
tational effort that scales as 0(b) when variance of X n is finite. This is similar to the 
complexity of the zero variance operator since, as is well known, the latter requires order 
^[ r &l T b < 00 ] computation in generating a single sample and this is known to be linear 
in b when the variance of increments is finite (see Asmussen and Kluppelberg (1996)). 
However, since E[t;,|t{, < 00] = 00 for the case of increments having infinite variance, 
even the zero- variance measure (even if implementable) is no longer viable because from 
a computational standpoint, any useful estimator needs to have finite expected replica- 
tion termination time. For random walks with infinite variance increments, we develop 
algorithms such that: 

(a) For q > 1.5, the associated estimators are strongly efficient and have 0(b) expected 
termination time. As a converse, we also prove that for a < 1.5 no algorithm can be 
devised in our framework that has both the variance and expected termination time 
simultaneously finite. The situation is more nuanced when a = 1.5 and depends on 
the form of the slowly varying function L(-). 

(b) For a < 1.5, each replication of the estimator terminate in 0(b) time on an average 
and we require only O(l) replications, thus resulting in overall complexity of 0(b). 
The relative deviation (the ratio of the absolute difference between the estimator and 
the true value with the true value) of the values returned by the algorithm is well 
within the specified limits with high probability, even though the estimator variance 
is infinite. 

The above results for infinite increment variance, and in particular the bottleneck arising 
at a = 1.5, closely mirror the results proved in Blanchet and Liu (2012) where vastly 
different state-dependent algorithms are considered. 

A brief discussion on practical applications and a literature review may be in order: Effi- 
cient estimation of the level crossing probability is important in many practical contexts, e.g., in 
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computing steady state probability of large delays in GI/GI/1 queues and in ruin probabilities 
in insurance settings (see, e.g., Asmussen and Glynn (2007)). Siegmund (1976) provides the 
first weakly efficient importance sampling algorithm for estimating the level crossing probabili- 
ties when the increments X n are light-tailed using large deviations based exponentially twisted 
change of measure. Sadowsky and Bucklew (1990) develop a weakly efficient algorithm for 
estimating ¥(S n > na) for a > 0, and Xj light-tailed, again using exponential twisting based 
importance sampling distribution (also see Sadowsky (1996), Dupuis and Wang (2004), Blanchet 
et al. (2009), Dieker and Mandjes (2005) and Agarwal et al. (2013) for related analysis). This 
problem is important mainly because it forms a building block to many more complex rare event 
problems involving combination of renewal processes: for examples in queueing, see Parekh and 
Walrand (1989) and in financial credit risk modeling, see Glasserman and Li (2005) and Bassam- 
boo et al. (2008). Research on efficient simulation of rare events involving heavy-tailed variables 
first focussed on probabilities such as P{<Sjv > b} in the simpler asymptotic regime where N is 
fixed or geometrically distributed and b /• oo. In this simpler setting state- independent algo- 
rithms are easily designed (see, e.g., Asmussen et al. (2000), Juneja and Shahabuddin (2002), 
Asmussen and Kroese (2006)). In Rajhaa and Juneja (2012), it is shown that a variant capped 
exponential twisting based state-independent importance sampling, which does not involve any 
decomposition, provides a strongly efficient estimator for the large deviations probability that 
we consider in this paper. 

Statistical analysis reveals that heavy-tailed distributions are very common in practice: in 
particular, heavy-tailed increments with infinite variance are a convenient means to explain the 
long-range dependence observed in tele-traffic data, and to model highly variable claim sizes 
in insurance settings. Popular references to this strand of literature include Embrechts et al. 
(1997), Resnick (1997), and Adler et al. (1998). 

The organization of the remaining paper is as follows: In Section 2 we discuss preliminary 
concepts relevant to the problems addressed. We propose our importance sampling method for 
estimating the large deviations probability and prove its efficiency in Section 3. In Section 4, 
we develop algorithms for estimating the level crossing probability. Proofs of some of the key 
results pertaining to efficiency of proposed algorithms and their expected termination time are 
given in Section 5. Numerical experiments supporting our algorithms are given in Section 6 
followed by a brief conclusion in Section 7. Some of the more technical proofs are presented in 
the appendix. 

2. Preliminary Background 

In this section we briefly review the use of importance sampling in estimating rare event proba- 
bilities, and the well-known asymptotics for relevant tail probabilities in the existing literature. 
Throughout this paper, we use Landau's notation for describing asymptotic behaviour of func- 
tions: for given functions / : ]R + —> M + and g : M + — > M + , we say f(x) = 0(g(x)) if there exists 
c\ > and x\ large enough such that f(x) < c\g{x) for all x > x\\ and f(x) = Q(g(x)) if there 
exists C2 > and xi large enough such that f(x) > C2<?(x) for all x > Xi- We use f(x) = o(g(x)) 
if f{x)/g{x) ->■ 0, and f(x) ~ g(x) if f(x)/g(x) ->■ 1, as x /■ oo. 

2.1. Rare event simulation and importance sampling. Let A denote a rare event on 
the probability space (Q,, J-, P), i.e., z := ¥(A) > is small (in our setup A corresponds to the 
events {S n > b} or {t^ < oo}). Suppose that we are interested in obtaining an estimator z 
for z such that the relative deviation \z — z\/z < e, with probability at least 1 — 5, for given e 
and 5 > 0. Naive simulation for estimating z involves drawing N independent samples of the 
indicator and taking their sample mean as the estimator. For a different measure P(-) such 
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that the Radon-Nikodym derivative dP/dP is well defined on A, we get: 



C dP 

P(A)= / -g(w)dP( W )=E[LI il ] 



where L := dP/dP and E[-] is the expectation associated with P(-). Define Z := LIa; then Z is 
an unbiased estimator of z under measure P(-). If N i.i.d samples Z\, . . . , Zn of Z are drawn 
from P(-), then by the strong law of large numbers we have: 

Zi + . . . + Z N 
z N :— — > z a.s., 

as N /*■ oo. This method of arriving at an estimator is called importance sampling (IS). The 
measure P(-) is called the importance sampling measure and Z is called an importance sampling 
estimator. Using Chebyshev's inequality allows us to find an upper bound on the number of 
replications N required to achieve the desired relative precision: 

< Var(zJ CV\Z) 



z 



2 £ 2 Ne 2 



Here CV(Z) = y / Var(Z) / z is the coefficient of variation of Z. This enables us to conclude that 
if we generate at least 

1.1. d. samples of Z for computing z N , we can guarantee the desired relative precision. In naive 
simulation we use the measure P(-) itself and have Z = Ia as the estimator; so the number 
of samples required in (1) grows (roughly proportional to z~ l ) to infinity if z ~\ 0. As is well 
known, the choice P*(-) := P(-|A) as an importance sampling measure yields zero variance for 
the associated estimator Z = zIa (see e.g., Asmussen and Glynn (2007)). Then, every sample 
obtained in simulation equals z with P*(-) probability 1. However, the explicit dependence of 
Z on z, the quantity which we want to estimate, makes this method impractical. 

2.2. Efficiency notions of algorithms. Consider a family of events {A n : n > 1} such that 
z n := P(A n ) \ as the rarity parameter n oo. For an importance sampling algorithm to 
compute (z n : n > 1), we come up with a sequence of changes of measure (P n ( - ) '■ n > 1) and 
estimators (Z n : n > 1) such that E n Z n = z n , where E n [-] denotes the expectation operator 
under P n (-). 

Definition 1. The sequence (Z n : n > 1) of unbiased importance sampling estimators of {z n : 
n > 1}, is said to achieve asymptotically vanishing relative error if, 



lim < L ( 2 ) 

n->oo z„ 



The sequence [Z n : n > 1) is said to &e strongly efficient if, 



n—too z, 

and weakly efficient if for all e > 0, 



lun =5 — - < oo, (3) 



lim 2 _ < oo. (4) 
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The significance of these definitions can be seen from (1): if an algorithm is strongly effi- 
cient, the number of simulation runs required to guarantee the desired relative precision stays 
bounded as n / oo. If Var(Z n ) = o (z%) , then [Z n : n > 1) satisfies asymptotically vanishing 
relative error property. As a result, it is enough to generate o(5~ 1 e~ 2 ) i.i.d. replications of 
the estimator. As is apparent from the definition, all strongly efficient algorithms are weakly 
efficient, and vanishing relative error is the strongest notion among all three. Also it can be 
verified that naive simulation is not even weakly efficient. 

2.3. Related asymptotics. In this section, we list the well-known asymptotics of the 
quantities of interest; these asymptotic representations will be useful for arriving at importance 
sampling measures and proving their efficiency. 

1. Recall that /3 := (a A 2) _1 . Then we have, 

¥{S n > b} ~ nF(b), as n /• oo (5) 

for b > n^ +e , e > 0. (see Mikosch and Nagaev (1998) and references therein). Additionally, 
the following relations can be found in Mikosch and Nagaev (1998): as n / oo, 

1p\s n > b,maxX k < b\ = o(nF(b)) , (6) 

k<n J 

sup |P{#{1 <i<n:Xi>b} = l\S n > b} - 1| = o(l), and 



sup 

b>nP+ e 



max Xu < b,S n > b\X n > b > — 1 

k<n-l 



oil)- 



These large deviations asymptotics reveal that with the number of summands growing 
to infinity, with high probability, the sum becomes large because one of the component 
increments becomes large. 

2. Recall that t& := inf{/c : > b + kfj,} and M := sup n (5 n — nfi); the events {M > b} and 
{rft < oo} are the same. Let Fi(-) denote the integrated tail of F(-) as below: 



/•oo 

F/(x) := / F(u)du, for x > 0. 

J X 



The following asymptotics are well-known (see, for e.g., in Foss et al. (2011) and references 
therein). As b / oo, uniformly for any positive integer n, 



P{r b <n} / F(u)du, and 

A 4 Jb 



P{r 6 <oo}~-F 7 (6). (7) 
Then for any positive integers n\ and r&2 with n\ < ri2, 

P{ni < n < n 2 \ ~ — / F[u)du = , (8) 

A 1 Jb+mn ^ 

uniformly in n\ and ri2, as b oo. 

Further, the following characterization of the zero-variance measure P{-|t& < oo}, as in 
Theorem 1.1 of Asmussen and Kluppelberg (1996), sheds light on how the first passage 
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over a level b happens asymptotically: If we use a(b) := Fj(b)/F(b), then conditional on 

n < oo, 

^ 5 ^:0<n<lV%^Wf^,(-u/,:0<n<l),y 1 ') (9) 



in E x £>[0, 1) x R. The joint law of Y Q ,Yx is defined as follows: for y ,yi > 0,F{Y > 
Vo,Yx > yi } = P{Y X > y + yi} with F = Y 1; and 

P{n > yi } 



(l + yi/(a-l)) 



a-l ' 



Now we state a part of Karamata's theorem that provides an asymptotic characterization of 
the integrated tails of regularly varying functions: Consider a regularly varying function V(-) 
with index —a; if /3 is such that a — (3 > 1, 

u^V{u)du ~ (X) , as x /* 00. (10) 

a — p — 1 

See Embrechts et al. (1997) or Borovkov and Borovkov (2008) for further details. 

3. Simulation of {S n > b} 

Let X be a zero mean random variable with distribution F{-) satisfying the following: 

Assumption 1. The tail probabilities are given by F{x) := ¥{X > x} = x~ a L(x), for some 
slowly varying function L(-) and a > 1. IfVar[X] = 00, then 

— F{X < -x} 

lim — r— < OO. 

x^oo F{X > x} 

For the independent collection (X n : n > 1) of random variables which are distributed identically 
as X, define the random walk (S n : n > 0) as below: 

S = 0, and S n = X + 1 + . . . + X n for n > 1. 

In this section we devise a simulation procedure for estimating the large deviation probabilities 
f{S n > b} for b > n^ +e given any e > 0, and prove its efficiency as n / 00. Recall that 
/3 := (a A 2) _1 . The strategy is to partition the event {S n > b} based on whether the maximum 
of the increments {X\, . . . ,X n } has exceeded the large value b or not (Juneja 2007 considers 
this approach when n is fixed) : 



A&om{n, b) := < S n > b, maxX k > b > and A res (n, b) := < S n > b, maxX k <b> . 

^ k<n J ^ k<n J 

The asymptotics (5) and (6) in Section 2.3 indicate that for large values of n, the most likely 
way for the sum S n to exceed b is to have at least one of the increments {Xi, . . . , X n } exceed 
b. Hence the probability of the event A rcs is vanishingly small compared to the probability of 
Aiom> asn/oo; the suffixes stand to indicate that Aiom is the dominant way of occurrence of 
{S n > b} for large n, and the other event has only residual contributions. We estimate P(Adom) 
and P(A rcs ) independently via different changes of measure that typify the way in which the 
respective events occur, and add the individual estimates to arrive at the final estimator for 

^{Sn > b}. 
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3.1. Simulating ^4dom- For the simulation of ^domi we follow the two-step procedure outlined 
in Chan et al. (2012): 

1. Choose an index I uniformly at random from {1, . . . , n} 

2. For k = l,...,n, generate a realization of Xk from F(-\Xk > b) if k = I; otherwise, 
generate Xf. from F(-). 

Let P(-) denote the measure induced when the increments are generated according to the above 
procedure, and let E[-] denote the corresponding expectation operator. Note that the probability 
measure P(-) is absolutely continuous with respect to P(-) when restricted to A dom . We have, 

t^i n F ^ 
Therefore the likelihood ratio on the set ^dom is given by, 

dF nF(b) 



dF" ' #{Xi>b:l<i<nY 

and the resulting unbiased estimator for the evaluation of P(Aiom) is, 

Generate N independent realizations of Z dora and take their sample mean as an estimator 
of P(Adom)- To evaluate how large N should be chosen so that the computed estimate satisfies 
the given relative error specification, we need to obtain bounds on the variance of Z dom . Since 
#{Aj > b : 1 < i < n} is at least 1, when the increments are drawn following the measure P(-), 
we have: Z dom (n,b) < nF(b), and hence, 

E[Z 2 dom (n,b)] < {nF(b)) 2 . 

Also E [Z dom (n, b)} = F(A dom (n, b)) ~ F{S n > b} ~ nF(b), as n / oo. Therefore we get, 

Var [Z dom (n, b)\ = o ((nF(6)) 2 ) , as n / oo. (12) 

3.2. Simulating A res . We see that all the increments {X%, . . . ,X n } are bounded from above 
by b on the occurrence of event ^4 res - Though the bound on the increments vary with n, we can 
employ methods similar to exponential twisting of light-tailed random walks to simulate the 
event ^4 res , as illustrated in this section. For given b, define 



A 6 (0) := log (J exp(9x)F(dx)^j , 9 > 0. 



Since the upper limit of integration is b, A(-) is well-defined for any positive value of 9. For given 
values of n and 6, consider the distribution function Fg(-) satisfying, 

^4 = ex P(^ x - A 6 (0„, ft ))l(* < b), 
dr (x) 

for all x G K and some 9 n & > 0. Now the prescribed procedure is to just obtain independent 
samples of the increments {Xi, . . . , X n } from Fq(-) and compute the likelihood ratio due to the 
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procedure of sampling from a different distribution Fg(-). Let Pe(-) and E#[-] denote, respec- 
tively, the corresponding importance sampling change of measure and its associated expectation 
operator. Note that the dependence of Fg(-),¥g(-) and Eg[-] on n and b has been suppressed in 
the notation. Then for given values of n and b, we have the following unbiased estimator for 
the computation of P(A rcs ) : 

Z rcs (n, b) := exp {-0 n S n + nA b (0 n)b )) I(A Tes ). (13) 

Now generate independent replications of Z rcs and take their sample mean as the computed 
estimate for P(A rcs ). However it remains to choose nb . Since S n is larger than b on A res , 



Z Tes (n, b) < exp (-9 n b + nA b (0 nib )) I(A res ). 



If we choose 



log (nFCb)) , s 

n ,b ■= V b ' , then (14) 

Z rcs (n, b) < nF(b) exp (nA b (0 n>b )) I(A TCS ). (15) 

We use Lemma 1, which is proved in the appendix, to obtain an upper bound on the second 
moment of the estimator Z res . 

Lemma 1. For the choice of n b as in (14), 

exp (A 6 (0„ >6 )) < 1 + 1(1 + 0(1)), 
n 

as n oo, uniformly for b > n@ +e . 
Therefore there exists a constant c such that 

exp (nA b (0 njb )) < c, 

for all admissible values of n and b. We evaluate the second moment of the estimator Z Tes through 
the equivalent expectation operation corresponding to the original measure P(-) as below: 

Eg [Zl s (n,b)] =E[Z rcs (n,6)] < cnF(b)F(A res ), 

where the last inequality follows from (15). S ince P(^4 re s) — o(riF(b^ , as in (6), we obtain 
that: 

Var [Z rcs (n, b)] = o ((nF(b)) 2 ) , asn/oo, (16) 
thus arriving at the following theorem: 

Theorem 1. If the realizations of the estimators Z^ om and Z res are generated respectively from 
the measures P(-) and Pe( - )> and if we let, 

Z(n, b) := Z dom (n, b) + Z res (n, b), 

then under Assumption 1, the family of estimators (Z(n,b) :n> 1,6 > n /3+e ) achieves asymp- 
totically vanishing relative error for the estimation of¥{S n > b}, as n /* oo; that is, 

Var[Z(n,b)] 

o = Oil), 

(F{S n > b}) 2 

as n oo, uniformly for b > n^ +t . 
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Proof. S incG th.6 realizations of ^ciom Z rcs are generated independent of each other, the 
variance of Z is just the sum of variances of Z^ om and Z res ; the proof is now evident from (12), 
(16) and (5). □ 

Remark 1. A consequence of the above theorem is that, due to (1), the number of i.i.d. repli- 
cations of Z(n, b) required to achieve e-relative precision with probability at least 1 — S is at 
most o(e~ 2 <5 _1 ), independent of the rarity parameters n and b. In our algorithm each replication 
demands 0(n) computational effort, thus requiring a overall computational cost of 0(n), as 
n /*• oo. 

Remark 2. One can easily check that, this same simulation procedure can also be used to 
efficient computate probabilities P{Sn > b) when iV is a random variable independent of {Xi}. 

4. Simulation Methodology for {r b < oo} 

As before, the sequence (S n : n > 0) with So '■= and S n := X\ + . . . + X n represents 
the random walk associated with the i.i.d collection {X n : n > 1). We have EX n = 0, and 
P{X n > x} = x~ a L(x) for some slowly varying function L(-) and a > 1. Given u > 0, 
M := sup n (S n — nfi). Since (S n — nfi : n > 0) is a random walk with negative drift, the random 
variable M is proper. For b > 0, recall that the first-passage time T b ■= inf{n > : S n — nfi > b}. 
In this section we present simulation methods for the efficient computation of 

P{M > 6} = F{r b < oo}, as b /■ oo. 

Naive simulation of {r b < oo} will require generation of all the increments until the maximum 
of the partial sums exceed b. Due to the negative drift of the random walk (S n — rifi : n > 0), 
we have T b oo a.s. as b /~ oo, and hence this method is not computationally feasible. To 
counter the prospect of generating uncontrollably large number of increment random variables 
in simulation, we re-express P{t^ < oo} as below: Consider a strictly increasing sequence of 
integers (nj; : k > 0) with no = 0; also fix p := (pk : k > 1) satisfying p^ > for all k and 
^2kPk = 1' vec t° r P can be seen as a probability mass function on positive integers. If 
we consider an auxiliary random variable K which takes the value of positive integer k with 
probability p^, then we can write, 



r „ , IP{rcfc-l < n < n k } 

In < oo} = y p k 

fc>i yK 

"P{n^_i <T b < n K } 



E 



(17) 



where E[f(K)} = £fc>iP*/(*0. for any / : Z+ -+ R. 

Now in a simulation run, if the realized value of the auxiliary random variable K is k, 
generate a sample from a probability measure, possibly different from P(-), of a random variable 
Zk that has P{?T,fc_i < r b < n^} as its expectation under the changed measure. Then equation 
(17) assures us that repeated simulation runs involving generation of K according to the measure 
induced by p, and taking sample mean of such realizations of Zx/p K following the changed 
measure will yield an unbiased estimator for the quantity W{r b < oo}. 

The performance of any importance sampling algorithm following the outlined procedure 
will depend crucially on the choice of probabilities pk, and the change of measure employed to 
estimate ¥{nk~i < r& < n^}, for k > 1. The sequence (n^ : k > 0) partitions non-negative 
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integers into 'blocks' ((rik—i, n^] : k > 1). For reasons that will be clear later, we choose the 
blocks (n k ^i,n k ] in the following manner: Fix a positive integer r > 1 and let, 

no = 0, n k = r k , for k > 1. 

In the following section, we detail the importance sampling schemes for the efficient compu- 
tation of the quantities ¥{n k _i < < n k },k > 1; these will be used as building blocks to 
efficiently compute the ultimate object of interest P{rfe < oo}. 

4.1. Efficient simulation of {n k ^i < < n k }. In this section we present our state- 
independent importance sampling procedure, for the efficient computation of the probabilities 
P{rifc_i < Tb < n k }, that are uniformly efficient for k > 1. Define the following events: 

■n-k n k 

A k = |J {Xi >b + i/j,} and B k = p| {Xi < b + n k ^} . 
* =n fc-i+i «=i 

The events A k and B k are defined in the same spirit as that of A& ora and A res in the simulation 
of {S n > b} in Section 3: the event A k includes sample paths that have at least one "big" jump 
of appropriate size in one of the increments indexed between n k ~i and n k , whereas on the other 
set B k we have all the increments bounded from above. The following lemma, proved in the 
appendix, asserts that asymptotically A k is the most likely way for the event {n k -i < < n k } 
to happen. 

Lemma 2. For any e > 0, there exists b e such that for all b > b t , 

P{n fc _i < r b < n k ,A k } _ 1 <e 
< n < n k } 



sup 

k>l 



As in the simulation of large deviation probabilities of sums of random variables in Section 3, 
we can partition the event {n k ^i < r& < n k } into: 

{n fc _i < n < n k ,A k }, < r h < n k , B k } and {n k _ 1 < n < n k , A k n B k }, 

and arrive at unbiased estimators for their probabilities separately via different importance 
sampling measures. 

4.1.1. Simulating {n^ < r b < n k ,A k }. Let q k (b) := Z^ refe _ 1+ i F{b + ifi). We prescribe 
the following two step procedure: 

1. Choose an index J £ {n k -\ + 1, . . . , n k } such that Pr{ J = n} = ^~7T) ; f° r n k-i < n < 
n k . 

2. Simulate the increment X n from F(-\X n > b + n/i), if n = J; otherwise, simulate X n from 
F(-), for any n < n k . 

In this sampling procedure, we induce the 'big' jumps typically responsible for the occurrence 
of {n k _i < Tb < n k } with suitable probabilities by sampling from the conditional distribution 
F{'\Xj > 6 + J/j)- This sampling procedure results in the importance sampling measure Pfc,i(") 
characterised by: 



ar ktl{Xl ,...,x nk) .- 2^ qk{b) ■ F(b + in) ^- b + 
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This in turn yields a likelihood ratio, 



— 1^1, . . . , A„ fe ; 



<flPfc,i nfc #{-Xi>6 + iM:n fc _i<i<n fc }' 

on the set Then we have, 

Zk,i(b) := irnr^TZ qk{b) ^ ■ ^ , I("ife-l < ^ < n fc ,,4 fc ) (18) 

as the unbiased estimator for the quantity ¥{n k ^i < 77, < n k ,A k }. Here note that I(nfc_i < 
i~b < ^fc,^4fc) = 1 a.s. under P^i. 

Lemma 3. Uniformly for k > 1, 

Var [Z M (6)] = o ((P{n fc _! < r 6 < n fc }) 2 ) , as b / oo. 

Proof. Since the quantity >b + ifi: n k _\ < i < n^} is at least 1 when the increments are 

generated from Pfc,i(-)> 

2jfc,i(&) < <?fc(&), 

and hence, 

E M [ZfJ < g 2 (6). (19) 

We have, 



q k (b)= F(b + i(j,)< / F(b + u^i)du= / F(b + ufi)du. 



Changing variables from u to u = 6 + u/i gives, 

1 rb+n k ij, 
qk(b) < - / F(u)dv. 

Now given e > 0, because of (8) and (19), 

E M [Z 2 Kl ] < (1 + e) (P{n fc _! < r b < n fc }) 2 

for all and 6 large enough. Also, 

< r b < n fe ,^ fc } > (1 - e)P{n fe _i < r 6 < n fe }, 

for all fc, because of Lemma 2. Therefore, 

Var [Z k>1 (b)] < (1 + e - (1 - e) 2 ) (P{n fc _x < r b < n k }) 2 

<3e(P{n fc _! <T b <n k } f. □ 



12 



4.1.2. Simulating {n^-i < r b < n k ,B k }. On the event B k , none of the random variables 
X\, . . . ,X nk exceed the level (b + n k -ifj,); since these increments are bounded (on B k ), we can 
draw their samples from an appropriately truncated, exponentially twisted variation of F(-) 
without losing absolute continuity on {re^-i < r b < n k ,B k }. For estimating P{n k _i < r b < 
rifc, -Bfc}i w e draw samples of Xi, . . . , X Tb ^ nk independently from the distribution Ffe(-) satisfying, 



dF k (x) 
dF{x) 



exp(9 k x - A k (9 k ))l(x < b + n k _i[i), x£ 



here, 9 k (= 9 k (b)) := -^F(b + n k ^)) ^ 

A k (9):=log(J exp(6 k x)F(dx)\ , > 0. (21) 

Let Pfe,2( - ) be the measure induced by drawing samples as above. Then the resulting likelihood 
ratio on {n k ^i < r b < n k , B k } is: 



dP 

— (X 1 ,...,X nk )=enp(-9 k S n +T b A k (e k )). 



k,2 

The associated estimator for computing P{n k _i < T b < n k ,B k } is: 

Z K2 {b) := exp(-9 k S Tb + T b A k {9 k ))I(n k ^ < n < n k ,B k ) (22) 

The following uniform bounds, which help in analyzing the variance of the estimator Z k 2 -> are 
proved in the appendix. 

Lemma 4. For all values of k and b, there exists a positive constant c\ such that, 

exp(n k A k (9 k )) < c\. 



Lemma 5. For all values of k and b, there exists a positive constant c 2 such that, 

n k F(b + nfc-i/i) 

PK-i < n < n k } ~ ° 2 - 

Using these results, we now present an asymptotic analysis on the variance of the estimators 
Zk,i{-). 

Lemma 6. Uniformly for k > 1, 

Var [Z K2 {b)\ = o ((P{n fc _i < n < n fc }) 2 ) , as b / 00. 

Proof. Since n k /r < r b < n k on the event {n k -\ < r b < n k }, 

exp (T b A k (9 k )) I(rafc_i < r 6 < n fc , B k ) < c\ V c'J 1 =: c 

Resolve ci,c. because of Lemma 4. Further note that 9 k S n > — log(n k F(b + n k -\fj)) on 
{ n fc-i < T b < n fc}- Therefore from (22), 

Zfc,2(&) < c (n k F(b + njt-in)) l(n k -i < n < n k , B k ), for all fe. 
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Now, changing the expectation operator in the evaluation of second moment of the estimator, 
results in the following bound: 

E fc ,2 [Zl 2 (b)] =E[Z M (6)] 

< c (n k F(b + rifc-i//)) P{n fe _i < r & < n k ,B k }. 

Here we apply Lemma 2 for a bound on the probability term in the above expression. Given 
e > 0, for all b large enough, we have: 

E fc,2 < c (n k F(b + n fe _i/i)) (eP{n fe _i < r h < n k }) , for all k. 

Now using Lemma 5 we obtain, 

Var [Z K2 {b)\ < ec.c 2 (P {n k ^ < r b < n k }) 2 . □ 

4.1.3. Simulating {n k -\ < r& < n k ,A k n Sfc}. We draw samples in a two step procedure 
similar to that in the Section 4.1.1. 

1. Choose an index J uniformly at random from {1, . . . , n k } 

2. Simulate the increment X n from F(-\X n > b + n k —ifj,), if n = J; otherwise, simulate X n 
from F(-), for any n < n k . 

If P& 3 denotes the change of measure induced by drawing samples according to the above 
procedure, then the likelihood ratio on the set {n k -i < r& < n k , A k n B k } is: 

df fY Y , n fc F(6 + n fc _!/i) 

-(Ai, . . . , A nfe J 



dP fc ,3 nfe #{Ii>Hn wf i:l<i<n fc }' 

The resulting estimator for the computation of P{n k -i < r b < n k , A k n is: 

> b + n fe _i/x : 1 < i < n k \ 
Similar to Lemmas 3 and 6, we have the following result on the variance of Z k ^{-) : 
Lemma 7. Uniformly for k > 1, 

Var [Z k ${b)\ = o ((P{n fc _i < r b < n k }) 2 ^ , as b /■ oo. 

Proof. When the increments are generated as prescribed in the above two-step procedure, we 
have #{X{ > b + n k -ifJL : 1 < i < n k } > 1, and hence, 

Z k $(b) < n k F(b + (n fe _! < r 6 < ra fc , A fc n B k ) . 

Now a bound on the second moment of the estimator can be obtained as before: 

E k . 3 [zl 3 (b)]=E[Z k . 3 (b)] 

< n k F{b + n k ^m)¥ {n k _i < n < n k , A k n B k ) . 

Given e > 0, due to application of Lemma 2, for all k > 1 and b large enough, we have: 

E fc .3 [Zl 3 {b)] < n k F(b + n k - lf i) (eP{n fc _i < n < n k }) . 

Using Lemma 5, we write, 

Var [Z kt3 (b)\ < ec 2 (P{n fc „! < r b < n k }f , 
thus establishing the claim. □ 
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The estimator for ¥{n k _i < Tj, < n k } can be obtained by summing the estimators of 
P{n fe _i <n< n k , A k }, P{n fc _i < r h < n k , B k }, and P{n fc _i < n < n k ,A k n B k ] : 

Z fc (6) :=Z M (6) + Z fci2 (6) + Z fc)3 (6). 

Since the random variables {Z k j(b),j = 1,2,3} are generated independent of each other, 

Var [Z k (b)} = Var [Z k>1 (b)] + Var [Z k>2 (b)] + Var [Z kj3 (b)) 
= o((P{n fc _ 1 <r 6 <n fc }) 2 ), 

uniformly for k > 1, as b /*" oo, because of Lemmas 3, 6, and 7. This yields the following 
theorem: 

Theorem 2. The family of estimators {Z k (b);k > 1,6 > 0} achieves asymptotically vanishing 
relative error for the unbiased estimation of¥{n k ^i < r& < n^}, uniformly in k, as b /*■ oo; £/icrf 

Var[Z fc (6)] 

sup ^ = oil), 

fc >i (P{n fc _! < n < n k } f 

as b f - oo. 



Remark 3. For our choice of importance sampling measures, the likelihood ratios resulting in 
the simulation of {n k _\ < < n k ,B k } and {n k _i < r& < n k ,A k D B k } are 0(n k F(b + n k _ijj)). 
To have vanishing relative error, we need ¥{n k _i < Tb < n^} to be of the same order, which 
happens when the choice of (n& : k > 0) is geometric, as shown in Lemma 5. 

4.2. Simulation of {r& < oo} - the finite variance case. Here we develop on the ideas 
stated at the beginning of Section 4. We have the increasing sequence of integers (n k : k > 0), 

no = 0, n k = r k for k > 1, 

for some integer r > 1. Further, we have an auxiliary random variable K taking values in positive 
integers according to the probability mass function (p k : k > 1). As in (17), we re-express the 
quantity of interest as: 



[n < oo} = E 



P{n^_i <r b < n K ] 



Pi- 



From Section 4.1, we have estimators {Z k (b) : k > 1} that can be used to compute the cor- 
responding probabilities {¥{n k ^i < t\> < n&} : A; > 1} in an efficient manner. Consider the 
following simulation procedure: 

1. Draw a sample of K such that Pr{iT = A;} = p k . 

2. Generate a realization of Zx(b) as in Section 4.1. 

3. Return 

Pk 

We present the sample mean of the values returned by ./V independent simulation runs of the 
above procedure as our final estimate of F{t{, < oo}. Let Q(-) denote the probability measure in 
the path space induced by the generation of increment random variables as a result of one run 
of this sampling procedure; let E < ^[-] be the expectation operator associated with Q(-). Given 
b > 0, the overall unbiased estimator for the computation of F{rb < oo} is, 

ZQ.) := Mi. 

Pk 
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Note that the number of independent simulation runs needed to achieve a desired relative 
precision, as in (1), is directly related to the sampling variance of Z(b). If (Z(b) : b > 0) offer 
asymptotically vanishing relative error, we just need o(e~ 2 5~ 1 ) independent replications of the 
estimator. However, as pointed in Hammersley and Handscomb (1965), and further justified in 
Glynn and Whitt (1992), both the variance of an estimator and the expected computational 
effort required to generate a single sample are important performance measures, and their 
product can be considered as a 'figure of merit' in comparing performance of algorithms that 
provide unbiased estimators of P{rb < oo}. For any given b, let z/j, denote the largest index 
of the increment random variables (Xjs) considered for simulation in a particular simulation 
run. The expectation of Vb, then gives a measure of the expected number of increment random 
variables generated, and subsequently of the expected computational effort in every simulation 
run. In particular, the latter may be bounded from above by a a constant C > times the 
expectation of Vb- 

In a single run of the above procedure, if the realized value of K is k, we look for estimating 
P{nfc_i < Tb < rtfc} which does not entail the generation of more than increment random 
variables, thus ensuring termination. In particular, n K1 < Vb < n K . The following theorems 
give a measure of both the variance and the expected computational effort per replicaton of 
Z(b) for a specific choice of the probabilities p^- 

Theorem 3. For 

F / (6 + n fc _ 1 ^) -F/(6 + n fc /x) 

Pk = ; ; ,K > 1, (24) 

the family of unbiased estimators (Z(b) : b > 0) achieves asymptotically vanishing relative error 
for the computation o/P{t{, < oo}, as b /* oo; that is: 

— Var Q [Z(6)l 

6-S-oo P{T b < OO} 2 



Theorem 4. If F(-) is regularly varying with index a > 2, for the choice of p = (pk ■ k > 1) in 
(24): 

E Q [n]<^^b, asb 



Proofs of both these results are given later in Section 5. 

Remark 4. From Theorem 3, we have the vanishing relative error property for computing P{rt < 
oo} whenever the increment random variables X n have finite mean (irrespective of the variance). 
Therefore we require only o(e~ 2 5~ 1 ) i.i.d replications of Z(b) to arrive at estimators that differ 
relatively at most by e with probability at least 1 — 6. Now from Theorem 4 we conclude that, 
if the tail index a > 2 (in which case the increments have finite variance), our importance 
sampling methodology estimates P{r^ < oo} in 0(b) expected computational effort. 

Remark 5. From the conditional limit result in (9), one can infer that the values pk as in (24) 
match the zero-variance probability P{nfc_i < t& < n^Tb < oo} asymptotically. For tails F(-) 
with regularly varying index 1 < a < 2, we have that EfT^r;, < oo] = oo; that is, the zero- 
variance measure itself has infinite expected termination time! Since pk are assigned a value 
similar to P{n/%_i < < n^Tb < oo}, one might suspect infinite expected termination time for 
a single run of Algorithm 1 as well. As we note later in Remark 9 after proof of Theorem 4, for 
PkS as in (24), this is indeed the case. 
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4.3. Simulation of {t& < 00} - the infinite variance case. As in Remark 5, infinite 
termination time for a simulation algorithm is clearly unacceptable. The following question then 
is natural: By choosing p k s differently, even if it means compromising on estimator variance, can 
one achieve finite expected termination time for the procedure in Section 4.2? Before answering 
this question below, we introduce a family of tail distributions and their integrated counterparts: 
for any f3 > 2, define 

G(«(x) := S^, and G [ f ] (x) := f°° G^\u)du. (25) 



x 



Theorem 5. // the tail F(-) is regularly varying with index a £ (1.5,2], then for any f3 S 
(2,2a - 1), 

Gf\b + n k ^)-Gf\b + n k ^) 
Pk = , k > 1 (26) 

yields a family of unbiased estimators (Z(b) = Z K {b)/p K : b > 0) achieving 

1. strong efficiency: lim^oo p^^p < 00, and 

2. finite expected termination time: ~EP[vi\ < jtfwz^ b, as b f - 00. 

Remark 6. Because of the strong efficiency, we need just 0(e~ 2 5~ 1 ) i.i.d. replications of Z(b) to 
achieve the desired relative precision. As in Remark 4, due to the bound on E[f&] in Theorem 
5, the average computational effort for the entire estimation procedure is just 0(e~ 2 5^ 1 b). It is 
important to see this achievement in the context of Remark 5: the induced measure Q(-) deviates 
from the zero-variance measure such that we get finite expected termination time, but only at 
the cost of losing vanishing relative error property to strong efficiency. Thus for the selection of 
PkS as in (26), the suggested procedure ends up offering a vastly superior performance (in terms 
of computational complexity) compared to the zero-variance change of measure. 

Given this result, it is difficult not to wonder why the tail index a should be larger than 1.5 
in the statement of Theorem 5, and what happens when a < 1.5. The following result shows 
that it is indeed not possible to have both strong efficiency and finite expected termination time 
when the tail index a < 1.5. 

Theorem 6. // the tail index a < 1.5, there does not exist an assignment of {pk,n k : k > 1) 
such that both E,Q{Z 2 (b)] and KQ[vf,] are simultaneously finite. 

Remark 7. If the tail index a = 1.5, the possibility of having both E^[Z 2 (6)] and E^f^] finite 
will depend on the slowly varying function L(-). As we see in the proof of Proposition 6, 



E Q [Z'\b)]E Q [u b ] = n ( I ^F(u)du) ) , 



oc 



b 2 

as b 00. If L(x) = 0((logx)~ m ), m > 2, the above integral is finite, whereas if L(x) = 0{\ogx) 
it is infinite; and it easily verified that the case of L{x) = 0((logx)~ m ), m > 2, goes through 
the proof of Theorem 5, thus achieving both strong efficiency and finite expected termination 
time. This illustrates the subtle dependence on the associated slowly varying function L(-) for 
the existence of such p^s and n^s. 

As illustrated by Theorem below, for a € (1, 1.5], we still have algorithms that demand only 
0(b) units of expected computer time if we look for less stringent notions of efficiency. 
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Theorem 7. If the tail F(-) is regularly varying with index a G (1,1.5], then there exists an 
explicit selection of p = (pk : k > 1) such that the family of unbiased estimators (Z(b) : b > 0) 
satisfies both: 

E« \Z 1+ ~'(b)] ( a-l\ 

lim — — l - < oo for all 7 G 0, , and (27) 



fe^oo F{r b < oo} 1+7 V ' 2 - a 

E [vb] < Cb for some constant C. 

In particular, for the following selection of p = (pk '■ k > 1), 

P* = ^) ,*>1 (28) 

if f3 is chosen in (2,a + 7 _1 (a — 1)), both the above inequalities are satisfied. 

Remark 8. If the estimator Z(b) satisfies (27), similar to how we arrived at (1), it can be 
shown that 0(e~( 1+7 )<5~ 7 ) i.i.d. replications of Z(b) are enough to produce estimates having 
relative deviation at most e with probability at least 1 — 5. Now according to Theorem 7, the 
expected termination time in each replication is 0(b). Thus with the p^s chosen as in (28), we 
expend just 0(e - ( 1+7 )<5 -7 b) units of computer time on an average, which is still linear in b. 
The price we pay by not adhering to strong efficiency is the worse dependence on the parameters 
e and 5. 

It is further interesting to note that a vastly different state-dependent methodology devel- 
oped using Lyapunov inequalities in Blanchet and Liu (2012) also hits identical barriers and 
provides results similar to ours: They present algorithms that are both strongly efficient and 
possess 0(b) expected termination time for the case of tails having index a > 1.5; whereas when 
a G (1, 1.5], they provide estimators satisfying (27) along with 0(b) expected termination time 
of a simulation run. 



5. Proofs of key theorems 



5.1. Proof of Theorem 3. Recall that the overall estimator Z(b) = Zx(b)/pK, where pk is 
as in (24). Second moment of the estimator Z(b) is bounded as below: 



¥fl[Z 2 (b)} = E Q 
= E Q 
= E Q 



Z 2 K (b) 



P 2 K 



K 



Z 2 K (b) 



[n K -i <n< n K y 



>{n K -i <n< n K } 2 
Given e > and large enough b, (8) and (24) give us: 

P{n^_i < n < n K } 1 + e 



P\ 



K 



< 



Pt 



Also from Theorem 2, we have: 



z 2 K (b) 



P{n K -i <r b < n K } 2 



/' 



K 



Fj(b). 



(29) 
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for values of b sufficiently large. Then from (29) and (8), 

E Q [Z 2 (b)] < ii±^F 7 2 ( fe ) 

< (l + e) 4 P{^<oo} 2 , 

thus proving the asymptotically vanishing relative error property. □ 

5.2. Proof of Theorem 4. Recall that Uj, denotes the maximum of indices of the increment 
random variables (Xjs) considered for simulation in a particular simulation run. From the 
sampling procedures in Section 4.1, it is clear that < n K . Therefore, 



Pkn>k 



k>\ 

rpi + E rh Pk 



k>2 

I rb+r[i _ rb+r k+1 fi 

U b Fiu )du + ^f F 



Fj{b) 



(u)du . (30) 



k>l 



„6+r fc +V _ h + r k ii-h ft>+r k+1 l-i_ 

Since r k / F{u)du = " / F(u)du 



< - / uF(u)du -b F(u)du 

P \Jb+r k ii Jb+r k (i , 



VV +1 / F(u)du< - V / uF(u)du-b / F(u)du 

fc>1 Jb+r k [i V \ k>1 Jb+r k tJ, Jb+r k n 

f poo roo \ 

/ uF{u)du- j F{u)du\ (31) 



A* \Jb+r[i Jb+rfi 

r + oil) ( (b + r it) 2 b + ru\ 

< — ^- - 6 r F(b + r/x), as 6 / oo 

/i \ a — 2 a — 1 

r + 0(1) -6 2 F(6), 



/i(a - 1)(q - 2) 



where the penultimate step follows from Karamata's theorem (see (10)), and the final step just 
uses long-tailed nature of F(-). Also note that: f^ +rfl F(u)du < rfiF(b), and by application of 
Karamata's theorem, we have Fj(b) ~ 6J W ; as b /oo. Therefore from (30), 

E Q [^]< r , + ° (1 q U , as6/oo, 
/x(a — 2) 

thus yielding the required bound on the expected termination time. □ 

Remark 9. Similar to how we arrived at (31), lower bounds can be obtained to show that 
lE^fff,] = ^ (J;, 00 uF(u)du) . If the tail index a < 2, uF{u)du turns out to be infinite, and 
subsequently E^[i^] = oo. Though the assignment of p k s in (24) yields vanishing relative error 
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for any a > 1, it fails to provide algorithms which have finite expected termination time when 
the increment random variables X have infinite variance (e.g., when a < 2), thus making this 
choice of p k not suitable for practice. 



5.3. Proof of Theorem 5. 

1. Variance of Z(b): Since Q(K = k) = p k , 



E Q [Z 2 (b)] = E Q 



"if 



,Pk- 



Pi 



EQ[Z|(b)] P{n fc _! < n < n k y 
{n fc _i <n< n k } 2 p k 



Thanks to the uniformly efficient estimators developed in Section 4.1, Theorem 2 helps us 
to write: 



E* [Z\b)] < (1 + ef F{nfc - 1<Tb - nfc}2 , 

k Pk 

for large values of b. Due to the uniform 
in (26), we can write, 



(32) 



conver 



gence in (8), and the assignment of p k s as 



[n k -i <n< n k } 
Pk 



uniformly in k. Note that, 



^ h , v F I (b + n k - ll i)-F I {b + n k n) MP),,, 

S U + ej . IToT -Lr/ (,0), 



Fi{b + 77,jfc_i/i) - F/(o + n fc /i) = / F(u)du < (n k - n k _i)nF(b + n k _in), 



rb+n k n 



G i f ) (b + n k _ 1 fi)-G { f\b + n k fi)= f + ^ (u)du > (n k - n fc _i)/iG^ (6 + n fc /i), and 

G^Cft + nfc-i/i) G^)(6 + n fc ^) ^ 
GM(Hn^) -^(Kft + ^/x))- 1 +£Jr ' 

due to the regularly varying nature of Therefore for values of b sufficiently large, 

P{n fc _i < n < n k } F(b + n fc _!/i) G^\b + n k n) MP) (h \ 

Pk - ii + £J G(«(Hn tf )G(fl(H^) 7 [ ) 

= (l + e) 2 rP(b + n k - W f- a Gf\b), (33) 



for all k, because = ^ 01 • Then from (32) 



E« [Z 2 (6)] < (1 + ejVcf (6) J> + n^^P-K^ < r 6 < n fc } 

fe 

< (1 + efr^Gf \b) J> + n fe _ lM )^ Q / F(«)du, 

fe Jb+n k _ 1 fj, 
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because of (8). Consequently, 

rb+n k n 



E Q [Z\b)} < (1 + efr p Gf > (b) ^ / u^ a F{u)du 

k Jb+n k _ 1 [i 

POO 

<{l + efr^Gf\b) u^ a F(u)du 



<(l + e)^Gf\b)b^ 2a F _^_ v 

for large enough values of b due to Karamata's theorem if 2a — (3 > 1. This is indeed 
true because (3 is assumed smaller than 2a — 1 in the statement of Theorem 5. Further 
{a - l)F/(6) ~ bF(b) and b^- a Gf\b) ~ f>(6), as & / oo. Therefore, 

— E«[Z 2 (6)] (a-iy 

lim — i 9/ \ JS < -i — < oo. 

fe^oo F 2 (6) ~ 2a -0-1 

Now since P{t6 < oo} ~ [i~ l Fi{b), we have strong efficiency. 

2. Expected termination time: Since Uf, < n K , E*^ [i/j] < E^[ra K ] = ^2 k Pknk- For the choice of 
Pfc in (26), following exactly the same steps in the proof of Theorem 4, we arrive at: 

E Q h>] < - U / G^(u)du+ uG^\u)du-b G^\u)du 

f 1 \ Jb Jb+rfJ, Jb+rfj, 

since is regularly varying with tail index larger than 2, by application of Kara- 

mata's theorem, we have: 

°° vjQM {u)du ~ (b + (ft + r/x), 

which would not have been the case if we had persisted with using Fj(-) instead of G { f\-) 
for pfe. Again following the remaining steps in the proof of Theorem 4, we conclude that: 

EQ[n] -iw^2) b > as6 ^°°' 

thus yielding finite termination time even when the zero-variance measure fails to offer 
this desirable property. □ 

5.4. Proof of Theorem 6. Since Q(K = k) = p k , see that: 

~ Z K(b)] _ \- ^ Q [Z 2 k (b)} ^ \- n^k-i <n< n k } 2 



E Q [Z 2 (b)} = E c 



^2 _ nffcWj > 



k Pk k Pk 



because of Jensen's inequality. To arrive at a contradiction, let us assume that both E®[Z 2 ( 
and E ( ^[ffe] are finite. Then, 



> 



£ ^ 

v k J 



(34) 
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where the penultimate step follows from Cauchy-Schwarz inequality. Due to the uniform con- 
vergence in (8), given e > 0, for b large enough: 



22 Vn^Pink-! <n< n k } > (1 - e ) ^T k \ F(b + u)d 
k k J n k -i» 

> 22 / y/uF(b + u)du 



1 - e 



oo 

y/uF(b + u)du. 



VJi Jo 

Now it can be seen easily that the RHS is finite only when a > 1.5, via the following change of 
variable and the subsequent integration of the resulting regularly varying tail: 



I s/uF(b + u)du = I Vu - bF(u)du 

JO Jb 



> f y/u ■ a/1 - -F(u)du 

J b 2 V u 

> f b2 VuF(u)du, 



which cannot be finite if a < 1.5, thus arriving at the desired contradiction. Therefore from 
(34), we conclude that we cannot have both the second moment of Z(b) and the expected ter- 
mination time E^fi/f,)] to be simultaneously finite if the tail index a < 1.5. □ 

5.5. Proof of Theorem 7. The proof is similar to that of Theorem 5, and we provide only 
an outline of the steps involved. Since Q(K = k) = p k , as in (32), 

E«[Z^(6)] < (1 + e) 1+ ^( P{refc - 1<Tfc - nfc} ) 7 F{n, _ x < r b < n k} , 

for sufficiently large values of b. Then using (33) and (8), 

E«[Z 1+ T(6)] < (1 + 6) 1+3 T (pf \b)Y ^2 ( b + n fc _i/i) 7(/3 - a) P{n fc _! < r b < n k } 

k 

oo 



< (l + e) 2+3 T (Gf } (6)) / u<^-^F{u)du, 



which follows from the routine calculation in the proof of Theorem 5. Since f3 is smaller than 
a+7 _1 (a — 1) as in the statement of Theorem 7, the tail index of the integrand, a— a) > 1. 
Therefore we can apply Karamata's theorem to conclude that for values of b large enough, 

/ -ia\ \ 7 Kl{P~ a )+ l 

E Q [Z l+ i(b)] < (1 + e) 3+37 [Gf \b) ) r -F(b). 



a — 7(/3 — a) — 1 

Now observing that (a - l)F/(6) ~ bF(b),b p - a Gf ] \b) ~ JPj(6), and P{r 6 < oo} ~ ^F^b) as 
6 /* oo, we have: 

— E C ?[Z 1 +T(6)] M 2 (a-1) 
b~,oo ¥{r h < oo} 1+ t ~ a - 7(/3 - a) - 1 

Since /3 is ensured to be larger than 2, the same proof for E^[z^] = 0(b) goes through. □ 
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6. Numerical Experiments 



In this section, we present the results of numerical simulation experiments performed on exam- 
ples previously considered in literature, and compare the performance of our algorithms. 

6.1. Example 1 - estimation of P{S n > b}. Take X = AR, where P{A > x} = 1 A 
x~ 4 ,R ~ Laplace(l), and A is independent of R. We use N = 10,000 simulation runs to 
estimate P{S* n > re} for re = 100, 500 and 1000. In Table 1, we compare the numerical estimates 
obtained by our simulation procedure with the true values of ¥{S n > n} evaluated in Blanchet 
and Liu (2008) via inverse transform techniques; further, a comparison of performance of our 
methodology with Algorithms 1 and 2 in Blanchet and Liu (2008) (referred to as BL1 and 
BL2) has also been presented. From the columns CV, CV of BL1, and CV of BL2, it can be 
inferred that our state-independent simulation procedures yield estimators with substantially 
lower coefficient of variation throughout the range of values considered. 



n 


> n} 


Estimate (z) 


Std. error 


CV off 


CV of BL1 


CV of BL2 






for P{5„ > n} 










100 


2.21xHT 5 


2.17xl0" 5 


4.31xlO" Y 


1.97 


10.3 


4.7 


500 


1.04xl0~ 7 


1.05xl0~ 7 


6.91xl0- 10 


0.66 


1.0 


4.1 


1000 


1.25xl0" 8 


1.29xl0~ 8 


6.91xl0~ n 


0.53 


1.1 


3.8 



Table 1: Numerical result for Example 1 - here Std. error denotes the standard deviation of the 
estimator of P{5 n > re} based on 10,000 simulation runs; CV denotes the empirically observed 
coefficient of variation 



6.2. Example 2 - estimation of P{t{, < oo}. To facilitate comparison with existing methods, 
we use the following example from Blanchet and Glynn (2008): Consider an M/G/l queue with 
traffic intensity p = 0.5 and Pareto service times having tail F{V > t} = (1 + t)~ 2 - 5 . The 
aim is to estimate the probability that this queue develops a waiting time b in stationarity by 
equivalently estimating the level crossing probabilities P{t(, < oo} of the associated negative 
drift random walk. For this example, we use the simulation procedures discussed in Section 4 
and compare the results with that of the existing algorithms in literature in Table 2. While 
Algorithms AK (in Asmussen and Kroese (2006)) and DLW (in Dupuis et al. (2007)) restrict the 
arrivals to be Poisson, the schemes BGL, BG and BL referring to the algorithms, respectively, in 
Blanchet et al. (2007), Blanchet and Glynn (2008) and Blanchet and Liu (2012) do not impose 
any such restriction. 

In our implementation, r has been chosen to be 2 to keep the expected termination time 
low, as suggested by Theorem 4. The results reported in Table 2 correspond to the simulation 
estimates of P{Tfc < oo} for values of b = 10 2 , 10 3 and 10 4 using N = 10, 000 simulation runs. 
From Table 2, it can be inferred that the error offered by the estimates of our simpler state- 
independent procedure is much smaller when compared with other existing algorithms. Table 3 
gives a comparison of coefficient of variation of the estimators empirically observed for different 
values of r, and a fixed b = 10 3 . It can be seen from Table 3 as well that choosing r = 2 helps 
in keeping the relative error low. 

7. Conclusion 

In this paper we revisited the problem of efficient simulation of commonly encountered rare event 
probabilities associated with random walks having regularly varying heavy-tailed increments. 
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Estimation 

Std. error b = 10 2 b = 10 3 b = 10 4 

CV 





q 


75 


X 




-4 


'J 


1 5 




10' 
i. y.> 


-5 


9 


98 


x 


10" 
± yj 


-7 


Prnnrmpn 


4 


11 


x 


10" 


-6 


7 


89 




10" 


-8 


1 


39 


x 


10" 


-9 


m pt nnn 


n 


49 








n 


9^ 








n 


1 4 










x 


2fl 


X 


10" 


-3 




1 5 


x 


10" 


-5 


q 

•J 


98 


x 


1 0" 


-7 


AK 


1 


48 




10" 


-5 


2 


1 9 


x 


10" 


-7 


yj 


95 


x 


1 0" 

± yj 


-9 




1 


— 











70 








o 

yj 


70 

i yj 










1 

± 


u u 


v 


10" 


-3 




1 6 


v 


10" 


-5 


q 

■J 


91 


y 

A 


1 0" 


-7 


DLW 


r J 


on 




10" 


-6 


1 


69 


x 


10" 


-7 


2 


99 


x 


1 0" 


-9 




n 
u 


OU 








A 
U 


Do 








A 
U 


oU 










1 


02 


X 


10" 


-3 


3 


17 


X 


10" 


-5 


1 


13 


X 


10" 


-6 


BGL 


3 


84 


X 


10" 


-5 


1 


60 


X 


10" 


-6 


7 


28 


X 


10" 


-8 




3 


76 








5 


05 








6 


44 










1 


08 


X 


10" 


-3 


3 


15 


X 


10" 


-5 


9 


98 


X 


10" 


-7 


BG 


5 


97 


X 


10" 


-6 


9 


73 


X 


10" 


-8 


2 


07 


X 


10" 


-9 







55 











31 











21 










1 


05 


X 


10" 


-3 


3 


18 


X 


10" 


-5 


9 


88 


X 


10" 


-7 


BL 


3 


76 


X 


10" 


-5 


2 


60 


X 


10" 


-7 


8 


19 


X 


10" 


-9 




3 


58 











82 











83 









Table 2: Numerical result for Example 2 - here Std. error denotes the standard deviation of the 
estimator of P{t& < oo} based on 10,000 simulation runs; CV denotes the empirically observed 
coefficient of variation 



r 


Estimate 


Std. error 


CV 


2 


3.15xl0"" 5 


7.89 xl0~ s 


0.25 


10 


3.16xl0" 5 


1.03xl0" 7 


0.33 


100 


3.16xl0~ 5 


1.55xl0" 7 


0.49 



Table 3: Comparison of relative errors for different choices of r in Example 2 with b = 1000; 
here Std. error denotes the standard deviation of the estimator of P{t{, < oo} based on 10,000 
simulation runs; CV denotes the empirically observed coefficient of variation 

These comprised the large deviations probability of a random walk exceeding large values as well 
as the level crossing probability of a negative-drift random walk. In the existing literature there 
are results that suggest that state-independent methods for such probabilities are difficult to 
design. Significant research over the last few years has resulted in sophisticated state-dependent 
importance sampling techniques for estimating these probabilities. Our key contribution has 
been to challenge this view by showing that simple state-independent importance sampling 
methods, that are at least as efficient as the existing state-dependent methods, can indeed be 
devised to estimate these probabilities. 

Our approach relied on partitioning the rare event of interest into elementary events that 
were amenable to straight forward state-independent importance sampling methods. We expect 
that this approach will generalize to more complex, multi-dimensional problems, and for similar 
problems involving Weibull-type sub-exponential tail distributions. 
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Appendix 

Here we present proofs of Lemmas 1, 2, 4 and 5. To prove Lemmas 1 and 4, we need Lemmas 8 
and 9, which are stated and proved below. The proof of Lemma 8 follow the lines of Borovkov 
and Borovkov (2008), where bounds for similar integrals have been derived. 

Lemma 8. For any pair of sequences {x n }, {4> n } satisfying x n /*■ oo and (f) n x n /* oo, the 
integral, 



f 

j — < 



e^ x F(dx) < 1 + ate + e 2a F ( — ) + e^ Xn F(x n )(l + oil)), asn/oo, 

\4>nJ 



for any < k < a A 2, and some constant c which does not depend on n and b. 

Proof. We split the region of integration into (—00,7/^] and (j/^ n ,x n ] for some constant 
7 > 0; the partition is such that the integrand stays bounded in the former despite its growth 
to (-00,00). Let I x ■■= fl^ n e^ x F(dx) and I 2 := P/™, e<t> nX F(dx). Since e^ x < 1 + 4> n x + 



n/4>n nl4>n nli>n 



F(dx) + <j) n xF{dx) + <f>« \x\ K e* nX F{dx) 

-00 J —00 J —00 

/OO f'OG f'OO 

F(dx) + (p n xF(dx)+ ( p^e' y \x\ K F(dx) 
-00 J— 00 J— 00 

= 1 + C (35) 

where c := e 7 \x\ K F(dx) < 00 because E|X| K < 00; this follows because k < a and from 
Assumption 1. We have also used EX = to arrive at (35). Integrating by parts for the second 
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integral I2 : 

h = - f " e^ nX F{dx 



e^ 7/ ^ n F(7/0 n ) - e$ nXn F{x n ) + (j) n I e (pnX F{x)dx 



h/<Pn 

<e^F(j/(f> n )+l' 2 , (36) 
where, I' 2 := </> n Jf/j, e^ nX F(x)dx. Now the change of variable u = 4> n {x n — x ) results in: 



In = I e~ u F x n - — )du 



iX-n—J 

e^ x "F(x n ) I e- u g n (u)du, (37) 



where, 

9n{u) := 



F(x n ) F(x n ) 
Since L(-) is slowly varying and 4> n x n — > 00, given any 5 > 0, for all n large enough we have: 

This preliminary fact about slowly varying functions can be found in, e.g., Theorem 1.1.4 of 
Borovkov and Borovkov (2008). So for any fixed u, we have g n ( u ) — s- 1 as ra / 00. Now fix 
5 = Tf. Then for n large enough, 

*(„)< (! + (38) 



Let /i(u) = (1 - u/<p n x n ) 2 . Since log/i(0) = and ^ (log(/i(u)) < |^ for < u < </> n :r n - 7, 

we have h{u) < e 2 ~< on the same interval. Therefore if we choose 7 = 2a, the integrand in I' 2 
is bounded for large enough n by an integrable function as below: 

1 + j J *(«)W-r} -l 1+ 2; e 7 = V 1+ 2J e 4 ' 
Applying dominated convergence theorem, we get 

/ e~ u g n (u)du ~ 1 as n / 00. 

Since J Xn e^ nX F(dx) = I± + I2, combining this result with (35), (36) and (37), completes the 
proof. □ 

Lemma 9. Given any e > 0, uniformly for b > n^ +e , we have: 

(a) nO* b \ for some < k < a, and 
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Proof, (a) We have F(x) = -^r- Since L(-) is slowly varying, given any 5 > for sufficiently 
large values of b, we have b~ 5 < L(b) < b s , thus yielding L(b) = &°W as b / oo. Further 
noting that b > n^ +e helps us to write: 

n Q R h = — log K ( — | < n 1 -^ 4 -^ loe K | — | . 

n ' b b- 8 ^rF(6)J- S \nL{b)J 



If we take, 



2, if a > 2 

k := \ , 39 

\(l + e )/(I + e ), ifl<a<2 



then k < a, and + e) > 1 + e/2. Then n9 R b \ as n /* oo, uniformly for 6 > n^ +e . 
(b) We have nj6 := - log (nF(b)) /b. Therefore, 



nF[ 2 ^]=nF{b) ^^W) 



7 w f(&) 
Since -F(-) is regularly varying, given any 5 > 0, for n large enough, 

ci / 2ab \ / , — i \ ct+<5 

J ^-log(nF(&))j < / - log (wF(fr)) \ 

F(6) - ^ 2a J 

The above inequality is just an application of Theorem 1.1.4 of Borovkov and Borovkov 
(2008). Therefore, 

— ) = o(l), uniformly for b > n^ +e asn/ oo. 

2o; / 

Here the convergence to is justified because a > 2 and b > n l3+t . □ 
Proof of Lemma 1. From the definition of A&(-) and Lemma 8, we have: 

exp (A b (9 n>b )) = / exp(6»„ )6 a;)F(da;) 

J — oo 

< 1 + c9^ b + e 2a F (J^-) + exp(0 n , 6 )F(6)(l + o(l)), 

for k as in (39). Usage of Lemma 8 is justified because 9 n b = — log (nF(b)) /* oo. The last 
term, 

eM0n, b b)F(b) = -^rj-Mb) = -. 

nF(b) n 

From Lemma 9, we have n9 R b = o (1) and F (2a/9 n ^) = o (1/n) , uniformly for b > n 

Therefore, exp (A b (9 n )) < 1 H — (1 + o(l)) , as n/ oo. 

n 



□ 
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Proof of Lemma 2 It is enough to show that, given e > 0, there exists b e such that for all 
b > b e , 

. ¥{n k -i < n < n k , A k } 

ml - — > 1 — e. 

fc>l P{n fc _i < T b < n k } 

n k 

F{nk~i<T b <n k ,A k }= ^ ¥{r = j,A k } 

j=n k -i+l 

> F{T = j,S i >-(A + i6)fyraRi<j,X j >b + A + j(p + S)}, 

3=n k -i+l 

for some positive constants A and 5. Let M n := max k < n (S k — k/i) and M := sup k (S k — kfi). 
Then, 

P{n fe _i <n<n k ,A k } > P {Mj-i < b, Si > -(A + ig) for all i < j, Xj > b + A + j(p + 5)} 

j=n k -i+l 
n k 

> Yl ¥ {Mj-i <b,Si> -(A + iS) for all i < j} F{b + A + + 5)) 

j=n k -i+l 

n k 

> ¥{M < b,Si > -(A + i5) for alH} ^ F(b + A + j(p + 5)). 

j=n k -!+l 

Since P{M > 6} = o(l), as b /* oo, by union bound, we have: 

P ({M > 6} U {5i < -(vl + iS) for some *}) < e + P{S; < -(A + i<$) for some i}. 

Due to the law of large numbers, we can find i e such that for all i > i e , Si is larger than — (A+i5) 
with probability at least 1 — e. Further for the collection (Si : i < i e ), we can choose A large 
enough such that for all i < i e , Si is larger than —A with probability at least 1 — e. Then, 

P ({M > 6} U {5i < -(A + itf) for some i}) < 2e, 

and hence, 

P{n fc _ 1 <r fe <n fc ,^ fc }>(l-2e) £ F(6 + A + j(p + 5)). (40) 

j=n k -i+l 

Now consider, 

«/t n k fj+1 



]T F(b + A + j{fi + 5))> ]T J° F(b + A + u(fi + 5))d 

j=n k -i+l i=n fc _i+l J 

r n k 

> / F(& + ,4 + u(^ + 5))du. 

nt_i +1 



After changing the variables of integration, we get: 

nfe fe+A+rife(At+5) 

V F(6 + A + i( M + ,5))>— - / F(u)du 

j=n k „ 1+ l V + d Jb+A+(n k _ 1+ l)(u+6) 

_ Fj(b + A + (nfc_i + l)Qu + 5)) - Fj (b + A + n fc (^ + 5)) 

H + 5 
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Then from (40), 

P{ra fe _i < n < n k , A k } > -^—^ (J> (b + A + (n fc _i + + 8)) - F 7 (b + A + n fe (/x + <J))) . 

// + o 

Since 5 is arbitrary and Fj(-) is long-tailed, for values of b large enough, we have: 

P{n fc „! < n < n k , A k } > (1 ~ 2e)(1 ~ e) (Fj {b + (n fc _i + — Fj (b + n k fi)) . 

I 1 

Now from (8), for all fc, 

FK_i < r 6 < nfc.Afc} > (1 - 2e)(l - e) 2 P{n fc _! < r fe < n fc }, 
for large values of b, thus proving the claim. □ 

Proof of Lemma 4. Consider 9 : R + — > M + . From Lemma 8, we have that: for given e > 0, 
if x6(x) oo, then there exists x t such that for all x > x e , 

f X e e{ - x)u F(du) < 1 + c9 1+5 (x) + e 2a F ( + e e{x)x F(x)(l + e), 
J-oo \9{x)J 

for some 5 > 0. For this, we do not need any condition on left tail as in Assumption 1. By 
definition of 9 k (b) in (20), we have (b + n k —i/j,).0 k (b) /*• oo, either if b or k grows to infinity. 
Expressing 6 k (b) as 9 k , for values of b and k satisfying b + n k -ifj, > x e , we have, 

exp (A»(9»)) < 1 + + e 2 «F (^) + e^*+">-i")F(6 + n»_i/l)(l + c) 

<ex P (^ + ^(|) + J-(l + t: 
because 1 + x < e x and e 9k ^ b+nk - 1 ^ F(b + ra fe _i/x) = l/n fc . Then, 

exp (n fc A fc (0 fc )) < exp (cn k 9\ +s + e 2o n fe F + 1 + e J . (41) 

Also see that, 

if b and are such that (b + n k -in) is large enough. Similarly for given 5 > 0, there exists 
such that if 6 + n k -ifJ> > xg, then 

^1^1 * I -log^b+n*.^)) ) (I ( 1 W«+ 5 



F(6 + n fc _i^i) F(6 + n fc _i/i) " V 2 " \n k F(b + n fc _i/i) 

Then for values of 6 and such that (6 + n k ~ifi) is large enough, 

n fc F < n fc F(6 + n fc _iAt) f ^- log f = j r 

n k L(b + n k -in) ( 1 . / 1 



(6 + ?i fe _i^) Q V 2 « log Ufc^ + «fe-i^)y/ <e ' 

because a > 2. Combining this with (41) and (42), for b and k such that b + n k _i[i is sufficiently 
large, 

exp (n k A k (9 k )) < exp(l + 3e), 
thus establishing the claim. □ 
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Proof of Lemma 5. From the uniform asymptotic (8), given e > 0, for b large enough and 
for any k > 1, we have: 

n k F(b + n k -ifjb) n k F(b + n k -i(j,) £ fb+nk-11* F( u ) du 



P{n fc _! <r b < n k } I f&+«^ P{n fc _x < r fe < n fc } 

n k F(b + nfc-i//) 



^ + ^ ^F(6 + n k n)(n k (i - Uk-ifJ.) 



For k > l,n k = rn k _\\ then, 



P{n fc „x < r 6 < n fc } " v 7 + n kt x) 

<(1 A P ^ h + n ^) 

< (l + e)V*+\ 

for values of 6 large enough, and for any k > 1, because of the regularly varying nature of -F(-). 
Also, 

n lF {b + n^) < (1 + g) jrgg) < (1 + e )», 



P{n < < "-1} rF(6 + r/i) 

for large values of b, because F(-) is long-tailed. Thus for any k, we can find a constant C2 such 
that the claim holds. □ 
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